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RAW SEQUENCE LISTING 

PATENT APPLICATION US/08/945,459A 



DATE: 1 1/02/98 
TIME: 1 1 :40:25 



INPUT SET: S29578.raw 



This Raw Listing contains the General 
Information Section and up to the first 5 pages. 



SEQUENCE LISTING 



1) General Information: 
(i) 



APPLICANT: MAKISHIMA, FUSAO; TAKAMATSU, 
HIROYUKI; MIKI, HIDEO; KAWAI, 
SHINJI; KIMURA, MICHIO; MATSUMOTO, 
TOMOAKI; KATSUURA, MIEKO; ENOMOTO ' 
KOICHI; SATOH, YUSUKE 

(ii) TITLE OF INVENTION: A NOVEL PROTEIN AND 
PROCESS FOR PREPARING THE SAME 

(iii) NUMBER OF SEQUENCES: 4 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: BIERMAN, MUSERLIAN AND LUCAS 
LLP 

(B) STREET: 600 THIRD AVENUE 

(C) CITY: NEW YORK 

(D) STATE: NEW YORK 

(E) COUNTRY: USA 

(F) ZIP: 10016 



%0 



(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: FLOPPY DISK 

(B) COMPUTER: IBM PC COMPATIBLE 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: MICROSOFT WORD 97 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/945,459 

(B) FILING DATE: 09-DEC-1997 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT/ JP96/01062 

(B) FILING DATE: 19-APR-1996 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: JP7/322403 

(B) FILING DATE: 17-NOV-1995 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: JP7/93664 

(B) FILING DATE: 19-APR-19 95 
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RAW SEQUENCE LISTING DATE: 1 1/02/98 

PATENT APPLICATION US/08/945,459A TIME: 1 1 :40:27 

INPUT SET: S29578.raw 

(vii) ATTORNEY/AGENT INFORMATION: 



(A) NAME: CHARLES A, MUSERLIAN 

(B) REGISTRATION NUMBER: 19,683 

(C) REFERENCE/DOCKET NUMBER: 146.1275 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (212) 661-8000 

(B) TELEFAX: (212) 661-8002 

(C) TELEX: 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 AMINO ACIDS 

(B) TYPE: AMINO ACID 

(C) STRANDEDNESS : 

(D) TOPOLOGY: LINEAR 

(ii) MOLECULE TYPE: PEPTIDE 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: HOMOSAPIENS 
(F) TISSUE TYPE: FETUS 

(ix) FEATURE: 

(A) NAME/KEY: MPS 2 

(B) LOCATION: 383 TO 501 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

Pro Leu Ala Thr Arg Gin Gly Lys Arg Pro Ser Lys 
1 5 10 

Asn Leu Lys Ala Arg Cys Ser Arg Lys Ala Leu His 
15 20 

Val Asn Phe Lys Asp Met Gly Trp Asp Asp Trp lie 
25 30 35 

lie Ala Pro Leu Glu Tyr Glu Ala Phe His Cys Glu 
40 45 

Gly Leu Cys Glu Phe Pro Leu Arg Ser His Leu Glu 
50 55 60 

Pro Thr Asn His Ala Val lie Gin Thr Leu Met Asn 
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RAW SEQUENCE LISTING date- 1 1/02/98 

PATENT APPLICATION US/08/945,459A TIME: 1 1 :4(h28 

INPUT SET: S29S78.raw 

65 70 



Ser Met Asp Pro Glu Ser 
75 

Val Pro Thr Arg Leu Ser 
85 90 

He Asp Ser Ala Asn Asn 
100 

Glu Asp Met Val Val Glu 
110 



Thr Pro Pro Thr Cys Cys 
80 

Pro He Ser He Leu Phe 
95 

Val Val Tyr Lys Gin Tyr 
105 

Ser Cys Gly Cys Arg 
115 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 BASE PAIRS 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS : SINGLE 

(D) TOPOLOGY: LINEAR 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
ATAATGCCAC TAGCAACTCG TCAGGGC 27 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 BASE PAIRS 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: SINGLE 

(D) TOPOLOGY: LINEAR 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
CGTCGACTAC CTGCAGCCAC ACGACT 26 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 357 BASE PAIRS 

(B) TYPE: NUCLEIC ACID 

(C) STRANDEDNESS: DOUBLE 
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PATENT APPLICATION US/08/945,459A TIME: 11:40:30 



INPUT SET: S29578.raw 



15 3 (D) TOPOLOGY: UNKNOWN 
154 

155 
156 
157 
158 
159 
160 

161 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

162 

16 3 CCA CTG GCC ACT CGC CAG GGC AAG CGA CCC AGC AAG 36 

164 Pro Leu Ala Thr Arg Gin Gly Lys Arg Pro Ser Lys 

165 1 5 10 
166 

167 AAC CTT AAG GCT CGC TGC AGT CGG AAG GCA CTG CAT 72 

168 Asn Leu Lys Ala Arg Cys Ser Arg Lys Ala Leu His 

169 15 20 
170 

171 GTC AAC TTC AAG GAC ATG GGC TGG GAC GAC TGG ATC 108 

172 Val Asn Phe Lys Asp Met Gly Trp Asp Asp Trp lie 

173 25 30 35 
174 

175 ATC GCA CCC CTT GAG TAC GAG GCT TTC CAC TGC GAG 144 

176 lie Ala Pro Leu Glu Tyr Glu Ala Phe His Cys Glu 

177 40 45 
178 

17 9 GGG CTG TGC GAG TTC CCA TTG CGC TCC CAC CTG GAG 180 

180 Gly Leu Cys Glu Phe Pro Leu Arg Ser His Leu Glu 

181 50 55 60 
182 

183 CCC ACG AAT CAT GCA GTC ATC CAG ACC CTG ATG AAC 216 

184 Pro Thr Asn His Ala Val lie Gin Thr Leu Met Asn 

185 65 70 
186 

187 TCC ATG GAC CCC GAG TCC ACA CCA CCC ACC TGC TGT 252 

188 Ser Met Asp Pro Glu Ser Thr Pro Pro Thr Cys Cys 

189 75 80 
190 

191 GTG CCC ACG CGA CTG AGT CCC ATC AGC ATC CTC TTC 288 

192 Val Pro Thr Arg Leu Ser Pro lie Ser lie Leu Phe 

193 85 90 95 
194 

195 ATT GAC TCT GCC AAC AAC GTG GTG TAT AAG CAG TAT 324 

196 lie Asp Ser Ala Asn Asn Val Val Tyr Lys Gin Tyr 

197 100 105 
198 

199 GAG GAC ATG GTC GTG GAG TCG TGT GGC TGC AGG 357 

200 Glu Asp Met Val Val Glu Ser Cys Gly Cys Arg 

201 110 115 
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